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Abstract 



Known strategies for sending bits at the capacity rate over a general channel with classical input 
and quantum output (a cq channel) require the decoder to implement impractically complicated 
collective measurements. Here, we show that a fully collective strategy is not necessary in order to 
recover all of the information bits. In fact, when coding for a large number A'^ uses of a cq channel 
3^ W, N-I{Wacc) of the bits can be recovered by a non-collective strategy which amounts to coherent 

quantum processing of the results of product measurements, where /(Wacc) is the accessible 
information of the charmel W. In order to decode the other N{I{W) — /(Wacc)) bits, where I{W) 
is the Holevo rate, our conclusion is that the receiver should employ collective measurements. We 
also present two other results: 1) collective Fuchs-Caves measurements (quantum likelihood ratio 
measurements) can be used at the receiver to achieve the Holevo rate and 2) we give an explicit 
■y form of the Helstrom measurements used in small-size polar codes. The main approach used to 

^ demonstrate these results is a quantum extension of Arikan's polar codes. 
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^ fT^ Introduction 



One of the most impressive recent developments in coding theory is the theory of polar 
codes These codes are provably capacity achieving, and their encoding and decoding 
complexities are both O(A^logA^), where iV is the number of channel uses. Polar codes 
are based on the channel polarization effect, in which a recursive encoding induces a set 
of TV synthesized channels from N instances of the original channel, such that some of the 
synthesized channels are nearly perfect and the others are nearly useless. The fraction of 
synthesized channels that is nearly perfect is equal to the capacity of the channel, and thus 
the coding scheme is simple: send the information bits through the synthesized channels that 
are nearly perfect. 

An essential component of the polar coding scheme is Arikan's successive cancellation 
decoding algorithm [1 . This algorithm is channel dependent and operates as its name suggests: 
it decodes the information bits one after another, using previously decoded information to aid 
in constructing a test for decoding each bit in succession. In particular, the test for decoding 
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each information bit is a likelihood ratio test. Due to the structure in the polar encoder, 
there is a great deal of structure in the decoding tests, so much so that each likelihood ratio 
can be recursively computed. The upshot is that the complexity of the decoding algorithm is 
0{N log N). 

Recently, there has been some effort in extending the theory of polar coding to the problem 
of transmission over quantum channels [231 1181 1261 125] . In particular, these works developed 
the theory of polar coding for transmitting classical data over an arbitrary quantum channel 
j23j , private classical data over an arbitrary quantum channel |1S], quantum data over a 
quantum Pauli or erasure channel [TH] , and quantum data over an arbitrary quantum channel 
|26j . To prove that the polar coding schemes in Refs. j231 achieve communication 

rates equal to well-known formulas from quantum information theory, the authors of these 
works constructed a quantum successive cancellation decoder as a sequence of quantum 
hypothesis tests (in the spirit of Arikan [IJ) and employed Sen's non-commutative union 
bound [20 in the error analysis. The major question left open from this effort is whether 
there exists an efficient implementation for a quantum successive cancellation decoder ^ 

In this paper, we detail our progress towards finding an efficient quantum successive 
cancellation decoder. The decoder outlined here is useful for decoding classical informa- 
tion transmitted over a channel with classical inputs and quantum outputs (known as a 
"classical-quantum channel" or "cq channel" for short). Since the schemes for private clas- 
sical communication [5S] and quantum communication [55] rely on the quantum successive 
cancellation decoder from Ref. [23], our results here have implications for these polar coding 
schemes as well. Our main result can be stated succintly as follows: 

► Claim 1. In order to achieve the symmetric Holevo capacity I{W) of an arbitrary cq 
channel W , at most N{I{W) — I{Wacc)) of the hits require a fully collective strategy in order 
for them to he decoded reliahly, while the other N ■ I{Wacc) hits can he decoded efficiently 
and reliahly in time 0{N'^) on a quantum computer using a product strategy that amounts to 
coherent quantum processing of the outcomes of product measurements. 

Although the main result of this paper might be considered modest in light of reaching 
the full goal stated above, it still represents non-trivial progress beyond prior research and 
towards answering the efficient polar decoding question. Indeed, one might think that 
collective measurements would be necessary in order to recover any of the bits of a message 
when communicating at the Holevo capacity rate, as suggested by the original work of Holevo 
jl5j , Schumacher, and Westmoreland [TS] and follow-up efforts on the pure-loss bosonic 
channel [HI [S] . Even the recent sequential decoding schemes suggest the same [7] [2D] (see also 
[21] for the pure-loss bosonic case). As a side note, these sequential decoding schemes require 
a number of measurements exponential in the number of channel uses — thus, even though 
the physical realization of a single one of these measurements may be within experimental 



^ By efficient, we mean that the decoder should run in 0{N'^) time on a quantum computer (or even 
better 0{N\ogN)). In computational complexity theory, "efficient" is often regarded to mean that 
an algorithm runs in time polynomial in the input length. However, for the demanding application of 
channel coding where delay should be minimized, we will consider a decoding algorithm to be "efficient" 
if it has a near-linear running time. 

^ Note that the scheme from Ref. | 18] does provide an efficient 0(N log N) implementation of a quantum 
successive cancellation decoder, essentially because sending classical states (encoded in some orthonormal 
basis) through a Pauli or erasure channel induces an effectively classical channel at the output (such 
that the resulting output states are commuting). One can then exploit a coherent version of Arikan's 
successive cancellation decoder to decode quantum information. Although this advance is useful, we 
would like to have an efficient decoder for an arbitrary quantum channel. 



M. M. Wilde, O. Landon-Cardinal, and P. Hayden 



reach [17j , the fact that these schemes require an exponential number of measurements 
excludes them from ever being practical. The previous result in Ref. |23j suggests that only 
a linear number of collective measurements are required to achieve the Holevo rate, and 
our work here demonstrates that the number of collective measurements required is at most 
NiIiW)-I{W,,,)). 

This paper contains other results of interest. First, we prove that collective Fuchs-Caves 
measurements (or quantum likelihood ratio measurements) [5] suffice for achieving the Holevo 
information rate with a cq polar coding scheme. It was already known from Ref. [23j that a 
sequence of Helstrom measurements suffices for achieving this rate, so this new result just 
adds to the ways in which one can achieve the Holevo rate of communication. We also plot 
the fraction of requisite collective measurements as a function of the mean photon number of 
the signaling states for the case of the pure-loss bosonic channel, in order to have a sense 
of the physical requirements necessary for high-rate communication over this channel. As 
one would expect, the fraction of collective measurements needed increases as the mean 
photon number of the signaling states decreases — we expect this to happen since the low 
photon-number regime is more quantum due to the non-orthogonality of the signaling states. 
Finally, we detail the explicit form of a polar decoder that uses Helstrom measurements — we 
do this for some simple two-, four-, and eight-bit polar codes. This final result should give a 
sense of how one can specify these tests for larger blocklength polar codes. 

The paper is organized as follows. The next section reviews background material such 
as cq channels, the Holevo quantity, quantum fidelity, the accessible information, and the 
classical fidelity (Bhattacharya parameter). Section [s] reviews the Fuchs-Caves measurement 
from Ref. and provides a useful upper bound on the error probability of a hypothesis test 
that employs this measurement as the decision rule. We review classical-quantum polar codes 
in Section [4.1[ Our first simple observation is that collective Fuchs-Caves measurements 



suffice for achieving the Holevo rate of communication (Section 4.2 1. Our main result, a 
justification for Claim [T] appears in Section [4. 3| In Section [5] we discuss the implications 
of Claim [T] for the pure-loss bosonic channel. Our last result on the explicit form of the 
Helstrom decoder for two-, four-, and eight-bit polar codes appears in Section [6] Finally, 
we conclude with a summary of our results and suggest that the Schur transform might be 
helpful in obtaining a general solution to the problem discussed in this paper. 

[2 Preliminaries 

A classical-quantum channel (cq channel) has a classical input and a quantum output. In 
this work, we only consider cq channels with binary inputs, written as 

W:x^p,, (1) 

where W labels the channel, the input x € {0, 1}, and px is a density operator. The symmetric 
Holevo information of this channel is 

I{W) = H{{po + pi)/2) - [H{po) + H{p,)]/2, (2) 

where H{a) = — Tr{(T log2 a} is the von Neumann entropy. The symmetric Holevo information 
gives one way to characterize the quality of a cq channel for data transmission: it is equal 
to one if po is orthogonal to pi and equal to zero if po — pi- The quantum fidelity F{W) is 
another parameter that characterizes the quality of a cq channel: 

F{W) = F{p„, Pi) ^W^oVp'iWi, (3) 
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where the trace norm of an operator A is defined as ee Tv{VA^} mUgj^ The 

quantum fidelity F{W) is equal to one if po — Pi and equal to zero if po is orthogonal to 
pi. We have the following relationships between the symmetric Holevo information and the 
quantum fidelity: 

I{W) w 1 ^ F{W) « 0, (4) 
I{W)kQ<^ F{W)kI^ (5) 

which are made precise in Proposition 1 of Ref. [23] . 

From any cq channel, it is possible to induce a purely classical channel PY\x{y\x) by 
having the receiver perform a quantum measurement at its output: 

PY\x{y\x) = T:T:{J^yPx}, (6) 

where A = {A^} is a positive operator- valued measure (POVM), a set of operators satisfying 
Ay > and Ky = I. Letting X be a uniform Bernoulli random variable and letting Y be 
the random variable corresponding to the outcome of the measurement, we can define the 
symmetric mutual information of the induced channel as 

I{W, A) = I{X- Y) = H{X) + H{Y) - H{XY) , (7) 

where H is the Shannon entropy of these random variables. The classical Bhattarcharya 
parameter is the statistical overlap between the resulting distributions: 

z{w, A) = ^ ^PY\x{y\Q) PY\x{y\i). (8) 

V 

If one were to encode the conditional distribution PY\x{y\x) along the diagonal of a matrix 
(so that it becomes a density operator), then it is clear that the symmetric Holevo information 
and fidelity of the resulting "cq channel" are equal to the symmetric mutual information and 
classical Bhattacharya parameter, respectively. 

The symmetric accessible information is equal to the optimized symmetric mutual 
information: 

/(W^acc) = max/(T4^,A), (9) 

{Ay} 

where the optimization is with respect to all POVMs A = {Aj,}. As a consequence of the 
well-known Holevo bound, the symmetric Holevo information is an upper bound to the 
symmetric accessible information (14j : 

/(W^acc) < I{W). (10) 

3 The Fuchs-Caves Measurement 

Rather than choosing a measurement to optimize the symmetric mutual information, one 
could also choose a measurement in such a way that it minimizes the statistical overlap 



^ Note that the quantum fidehty sometimes is defined as || >/Po>/pr|| i in order for it to have the 
interpretation as a probabihty. We choose to remove the square in this work (as is often done) 
in order for it to reduce to the classical Bhattacharya parameter when the states are just probability 
distributions. 
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between the resulting distributions PY\xiy\^) PY\xiy\^) [S]- We call such a measurement 
a "Fuchs-Caves" measurement since these authors proved that the minimum statistical 
overlap is equal to the quantum fidelity: 

min Z {W, A) = F{W). (11) 
{A„} 

Furthermore, they gave an explicit form for the measurement that achieves the minimum 
and interpreted it as a kind of "quantum likelihood ratio." Indeed, the measurement that 
achieves the minimum in (111 corresponds to a measurement in the eigenbasis of the following 
Hermitian operator: 



,, -1 -1/2 / 1/2 1/2 -1/2 

/90#Pi =Pi ypi PoPi Pi ■ (12) 

Diagonalizing pa # p^^ as 

po* Pi'^J2^y\y)(y\^ (13) 



Fuchs and Caves observed that the eigenvalues of po # Pi ^ take the following form: 

furthermore suggesting that this measurement is a good quantum analog of a likelihood ratio. 
In addition, Fuchs and Caves also observed that the operator 



-1 -1/2 / 1/2 1/2 -1/2 /-, r-N 

Pi # Po =Po yPo PlPo Po (15) 

commutes with and is the inverse of po if Pi^ ■ Thus, the eigenvectors of pi ^ p^^ are the 
same as those of po # ^^"^ eigenvalues are the reciprocals of those of po # PF^- 

► Lemma 2. When using the Fuchs-Caves measurement to distinguish pq from pi, we 
have following upper bound on the probability of error pe(W) in terms of the quantum 
fidelity F{W): 

Pe{W) < lF{W). (16) 



Proof. After performing the measurement specified by (13 1, the decision rule is as follows: 

decide pq if Xy > 1, (17) 
decide pi if Xy < 1, (18) 

which corresponds to the projectors 

no^ E \y)(y\^ (19) 

y ■■ ^B>i 

ni= E \y)(y\- (20) 

y ■■ >^y<^ 
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It is then easy to prove the bound in ( 16 1 : 
2 pe{W) = Tr{nopi} + Tr{nipo} 



y ■■ A„>1 y : \y<l 

E {y\pi\y)'^'{y\pi\yy^'+ E iy\po\y)'^'{y\po\yy^' 



y ■■ >'!/>i 

< E iy\pi\yy^'{y\p^\yy^'- 

y ■■ >'H>i 

= Y.^v\pi\v)"^vW\yY" 

V 



V ■■ Aj,<l 

E {y\po\y)''^{v\piW^ 



(21) 
(22) 

(23) 

(24) 

(25) 
(26) 



where the last equahty follows from (111 



4 Decoding Classical-Quantum Polar Codes 
4.1 Review 



Ref. 



demonstrated how to construct synthesized versions of W , by channel combining 



and splitting [T] . The synthesized channels VF^^ are of the following form: 



P(i).u. 



(27) 
(28) 



^gN ^ ^ 1 B 



'Gn' 



Pa™ = Px^ 



(29) 



where Gat is Arikan's encoding circuit matrix built from classical CNOT and permutation 
gates. The registers labeled by V are classical registers containing the bits u\ through 
and the registers labeled by B contain the channel outputs. If the channel is classical, then 
these states are diagonal in the computational basis, and the above states correspond to the 
distributions for the synthesized channels [T]. The interpretation of is that it is the 
channel "seen" by the input Ui if the previous bits w^^"^ are available and if the future bits 
"uf^i are randomized. This motivates the development of a quantum successive cancellation 
decoder |23j that attempts to distinguish ui — Q from = 1 by adaptively exploiting the 
results of previous measurements and quantum hypothesis tests for each bit decision. 

The synthesized channels w'^^ polarize, in the sense that some become nearly perfect 
for classical data transmission while others become nearly useless. To prove this result, one 
can model the channel splitting and combining process as a random birth process [TJ [23] , 
and then demonstrate that the induced random birth processes corresponding to the channel 
parameters /(VF^'') and i^(W^^) are martingales that converge almost surely to zero-one 
valued random variables in the limit of many recursions. The following theorem characterizes 
the rate with which the channel polarization effect takes hold |H US] , and it is useful in 
proving statements about the performance of polar codes for cq channels: 

► Theorem 3. Given a binary input cq channel W and any j3 < 1/2, it holds that 



lim Pr {F{W^i^) < 2-^ 

n— >-oo J 



(30) 
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where n indicates the level of recursion for the encoding, VFj^'' is a random variable charac- 
terizing the J*^ split channel, and F{W2i^) is the fidelity of that channel. 

Assuming knowledge of the identities of the good and bad channels, one can then construct 
a coding scheme based on the channel polarization effect, by dividing the synthesized channels 
according to the following polar coding rule: 

gM{W,P) ^ e [N] : < 2-^'}, (31) 

BN{W,P) = [N]\gr,{W,P), (32) 

so that Qn{W^ P) is the set of "good" channels and Bn{W, (3) is the set of "bad" channels. 
The sender then transmits the information bits through the good channels and "frozen" bits 
through the bad ones. A helpful assumption for error analysis is that the frozen bits are 
chosen uniformly at random and known to both the sender and receiver. 

One of the important advances in Ref. [23] was to establish that a quantum successive 
cancellation decoder performs well for polar coding over classical-q uant um channels with 
equiprobable inputs. Corresponding to the split channels W^^ in (27 1 are the following 
projectors that attempt to decide whether the input of the i*^ split channel is zero or one: 



(33) 

n;:ir"-/-n;:i;^", (34) 

where {B > 0} denotes the projector onto the positive eigenspace of a Hermitian operator B. 
After some calculations, one readily sees that 

nS?"-Ehr)(-rr^"^«n-V_,^, (35) 





i-l 


where 






_j_^urB- 

— ^ {i),0 ' 




, dN rJV 


„N 





(36) 

)}, (37) 

(38) 

The observations above lead to a decoding rule for a successive cancellation decoder similar 
to Arikan's [T]: 



where h(^u\ ^) is the outcome of the i"^ collective measurement: 

{nS".;-o' ^^.^i} (40) 

on the codeword received at the channel output (after i — 1 measurements have already 
been performed). The set A labels the information bits. The measurement device outputs 
"0" if the outcome 11^^ ^^-iq occurs and it outputs "1" otherwise. (Note that we can set 

^{i) ~ ^ frozen bit.) The above sequence of measurements for the 
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whole bit stream corresponds to a positive operator- valued measure (POVM) {A„n} 
where 



A„]v = H/-, ^ • • • n, i_i • • • n, , jv-i • • • n, i„i • • • n,--, , (4i) 

and X^n^ — I ■ The probability of error Pe{N, K, A, ) for code length N , number K 
of information bits, set A of information bits, and choice u^c for the frozen bits is 

P,{N,K,Au^.) - 1 - -^^Tr{A„«p„«}, (42) 



where we are assuming a particular choice of the bits uj^c in the sequence of projectors 



• • nf, • • • nf^> and setting Ilf, . , = / if u,- is a frozen bit. The 

formula also assumes that the sender transmits the information sequence with uniform 
probability 2^^. The probability of error averaged over all choices of the frozen bits is then 

Pe (N, K, A) = 5] (iV, K, A, UA^ ). (43) 



The following proposition from Ref. |23j determines an upper bound on the average ensemble 
performance of polar codes with a quantum successive cancellation decoder: 

► Proposition 4. For any classical- quantum channel W with binary inputs and quantum 
outputs and any choice of{N,K,A), the following bound holds 



P,{N,K,A)<2 J2kFiwl;^). (44) 

The proposition is proved by exploiting Sen's non-commutative union bound [20] and 
Lemma 3.2 of Ref. pjj (which upper bounds the probability of error in a binary quantum 



hypothesis test by the fidelity between the test states). The bound in (44 1 applies provided the 
sender chooses the information bits Ua from a uniform distribution. Thus, by choosing the 
channels over which the sender transmits the information bits to be in A and those over which 
she transmits agreed-upon frozen bits to be in A'^, we obtain that the probability of decoding 
error satisfies Pr{J74 ^ C/^} ~ o(2~3^''), as long as the code rate obeys R = K/N < I{W). 

A final point that will be useful is that Ref. [23 also proved that measurements consisting 
of the projections 



^(4,0 ~]|P{^].1 (45) 

rather than those in (33)-(34l, also achieve the performance stated in Proposition |4] 

4.2 Collective Fuchs-Caves Measurements Achieve the Holevo Rate 

Our first observation is rather simple, just being that collective Fuchs-Caves measurements 
can also achieve the performance stated in Proposition [4] This result follows from Lemma |2]s 
bound on the error probability of a Fuchs-Caves measurement and by performing an error 
analysis similar to that in the proof of Proposition 4 of Ref. [23] given in Section V of that 
paper. The explicit form of a Fuchs-Caves quantum successive cancellation decoder is given 



by projectors of the form in (35)-(38l, with the Helstrom tests replaced by Fuchs-Caves 



projectors as given in (19l-(20) 
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This result also demonstrates that there are a variety of decoding measurements that 
one can exploit for achieving the Holevo information rate. However, the quantum successive 
cancellation decoder consisting of Helstrom measurements should outperform either the 



measurements in (45 1 or the Fuchs-Caves measurements when considering finite blocklength 
performance because the Helstrom measurement is the optimal test for distinguishing two 
quantum states. 

4.3 Main Result 

Our main observation is a bit more subtle than the above, but it is still elementary. Nev- 
ertheless, this observation has nontrivial consequences and represents a step beyond the 
insights in prior work regarding decoding of classical information sent over quantum channels 

We begin by considering the "Fuchs-Caves" classical channel Wpc induced from W by 
performing the Fuchs-Caves measurement on every channel output: 

M^FC : X PY\x{y\x) = {y\px\y), (46) 



where the orthonormal basis {\y)} is the same as that in (13 1. The specification of the 
polar code in the previous section specializes to this induced classical channel. The code 
consists of a set of "good" synthesized channels Gn{Wfc, l^) and "bad" synthesized channels 
Bn{Wfc,I3), where 

g^iWFcl^) ^{^e [N] : F{W^l^) = Z{W^l^) < 2-^'}, (47) 
Bn{Wfc,P) = [N] \ GNiWFcP), (48) 

and the equality F{Wp^ jy) = ^(^fc n) holds because the induced channels are classical. 
Furthermore, by Theorem |3] the number of good channels in the limit that N becomes large 
is as follows: 

lim h5N{WFC,m = nWFc)- (49) 

A^— >oo iV 

Finally, each bit of this classical polar code can be decoded in time 0{N) using a recursive 
calculation of likelihood ratios as given in (75)-(76) of Ref. [10 

Now, our main observation is the following relationship between the good channels of Wfc 
and the good channels of W: 

Gn{Wfc,P)'^Gn{W,P). (50) 



This relationship holds because of the Fuchs-Caves formula from (11 1. For all i, we have that 



F{W';>) = min Z{W';>,A) < Z{W^'> ,,), (51) 

where the inequality follows because the tensor-product Fuchs-Caves measurement that 
induces the synthesized channel Wp^ ^ is a particular kind of measurement, and so its 
classical statistical overlap can only be larger than that realized by the optimal measurement 



Note that this is the "first decoding algorithm" of Arikan. A refinement impHes that aU of the bits can 
be decoded in time 0{N log N), but the first decoding algorithm is what we will use in this work. 
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(which in general will be a collective measurement rather than a product measurement). 
Now, for all i £ Gn{Wfc, P), we have that 

Z{W^l^)<2-''\ (52) 



and can conclude ( 50 1 



This in turn implies that F{wl^^) < 2 by (51 1, and so for this i, we have that i € Gn^W, (3) 



This observation has non-trivial implications for the structure of the polar decoder. For all 
of the bits in Qn{Wyc^ P)j the receiver can decode them with what amounts to an effectively 
"product" or "non-collective" strategyj^ while for the bits in ^Ar(VF, /3) \ GnO^fcj l^), we 
still require collective measurements in order for the receiver to decode them with the error 



probabihty guarantee given by (311. However, when decoding the bits in Gn{VVfcj f^), the 
receiver should be careful to decode them in the least destructive way possible so that Sen's 
non-commutative union bound is still applicable and we obtain the overall error bound 
guaranteed by Proposition [4] In particular, the decoder should begin by performing an 
isometric extension of the Fuchs-Caves measurement on each channel output: 



where the orthonormal basis {\y)} is from the eigendecomposition in (13 1 and the basis {|Ay)} 
encodes the eigenvalues to some finite precision. Such an operation coherently copies the 
likelihood ratios Aj, of the Fuchs-Caves measurement into an ancillary register. The receiver 
then performs a reversible implementation of Arikan's decoding algorithm to process these 
likelihood ratios according to (75)-(76) of Ref. [T]. Finally, the receiver coherently copies 
the value of a single decision qubit with a CNOT gate to an ancillary register, measures the 
decision qubit, and "uncomputes" these operations by performing the inverse of the Arikan 
circuit and the inverse of the operations in (53 1. Figure [l] depicts these operations. The 
effect of these operations is to implement a projection of the channel output onto a subspace 
spanned by eigenvectors \y^) = \yi) ® • • • ® |?/Ar) of the Fuchs-Caves measurements such that 

W^lAy'',u\-'\0) > W^l^{y^,u\~'\l), (54) 
or onto the complementary subspace spanned by eigenvectors |y^) such that 

W^l^{y^,ul'\0) < W^l^{y^,u\-'\l), (55) 
where y^ is the classical output of the Fuchs-Caves channel and u]~^ denotes the previously 



decoded bits. Thus, the fidelity bound from (52 1 is applicable and Sen's non-commutative 



union bound guarantees that the overall contribution of the error in decoding bit i e 
Gn{Wfc, 13) is no larger than 2^^ . The time that it takes to process each bit i € Gn{Wfc, P) 
is 0{N), which is clear from the structure of the circuit and Arikan's "first decoding 
algorithm." 

For all of the remaining bits i G Gn{W, f3)\GN{WFCi /3), we still do not know whether there 
exists an efficient quantum algorithm for decoding them while having the error probability 
from Proposition |4] Thus, for now, we simply suggest for the receiver to use collective 
measurements to recover them. 



^ If a decoding strategy amounts to coherent implementations of product measurements followed by 
coherent processing of the outcomes, we still say that it is a product strategy rather than collective. 
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Figure 1 The circuit for recovering an information bit in the set Qn{Wyc, P)- The encoder output 



is fed into A'' instances of the channel W. The receiver acts with A'' of the unitaries in (531, labeled 
as "FC" boxes which coherently copy the likelihood ratios Xy^ , ■ ■ ■ , Aj,„ into ancillary registers. The 
receiver then acts with a reversible implementation of Arikan's likelihood ratio computations, copies 
the decision bit into an ancillary register, and measures the decision bit to decode the i**" bit. The 
receiver finally performs the inverse of these operations to "clean up," i.e., to ensure that the next 
measurement can be performed, whether it be to decode a bit in the set Gn{Wfc, P) or the set 
Gn{W, j3) \ Gn{Wfc,P)- The effect of this circuit is to perform the desired "gentle projection." 



It should be clear from Proposition [3] and (49 1 that the size of the set Qn{W,[5) \ 
Gn{W-pctP) in the Hmit is equal to 

lim ^\gN{W,p)\gN{Wvc.P)\^I{W)-I{Wvc)- (56) 

This makes it clear that one does not require a collective strategy in order to recover all of 
the information bits, but a collective strategy is only required in order to bridge the gap 
between I{Wfc) and I{W). 

Observe also that similar reasoning applies to any product measurement, not just the 
Fuchs-Caves measurements (we focused on the Fuchs-Caves measurement due to its strong 
analogy with a likelihood ratio and because Arikan's decoding algorithm processes likelihood 
ratios). With this in mind, we could simply choose the product measurement to be the one 
that maximizes the accessible information, in order to maximize the number of bits that 
can be processed efficiently. Let Wacc be the classical channel induced by performing the 
measurement that maximizes the accessible information. One would then process the bits in 
Qn{Wi^cct P) in a way very similar as described above. All of the observations above then 
justify Claim [T] 

The reasoning also leads to a generalization of Lemma [2] that applies when using Fuchs- 
Caves measurements to distinguish a tensor-product state from pf^. The test consists 
of performing product measurements followed by classical post-processing. If one wishes to 
perform this test in the most delicate way possible, one could perform it as in Figure [T] 

► Lemma 5. When using product Fuchs-Caves measurements to distinguish p®^ from pf^, 
the probability of error pe is bounded from above in terms of the quantum fidelity F{po,pi): 

Pe<hF{po,pi)f. (57) 
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10 10 10 

Mean photon number 



W Figure 2 The fraction of collective measurements required for a polar decoder plotted as a 
function of the mean photon number E at the receiving end, when using a BPSK coding strategy. 



Proof. The proof is very similar to the proof of Lemma [2] The test, though, consists of 
performing individual Fuchs-Caves measurements on the N systems, and these tests result 



in likelihood ratios A^^, . . . , Aj,„. The decision rule is then as follows: 
decide ^ if x • • • x A„„ > 1, 



decide pf^ if A^, x 



X >'VN < 1- 



(58) 
(59) 



An analysis proceeding exactly as in (21)-(26l leads to the following bound 



2pe{w)< J2 [iyi\---{yN\pT\yi)---\yN)V/' [(yil---(yivlpnyi)---M]^/' 

yi,---,yN 

= E iyilPilyiV^' ■ ■ ■ iyNlPilvNy/' {yilPolyiV^' ■ ■ ■ ivNlPolyNy^' (60) 

yi,---,yN 

= J2(y^\p^\y^)'^'(y^\p"\y^y^' ■ ■ T.(y^\p^\y^y^'(y^\po\yN)'^' 



= [F{po,Pi)] 



N 



(62) 



Furthermore, one can implement this test efficiently and non-destructively on a quantum 
computer as described in Figure [T] The result is to project onto two different subspaces: the 
one spanned by eigenvectors whose corresponding eigenvalues satisfy (58 1 and the other. 



f Decoding the Pure-Loss Bosonic Channel 

A channel of particular practical interest is the pure-loss bosonic channel. A simple physical 
model for this channel is a beamsplitter of transmissivity rj G [0,1], where the sender 
has access to one input port, the environment injects the vacuum state into the other 
input port, the receiver has access to one output port, and the environment obtains the 
other output port. It is well known that the Holevo capacity of this channel is equal to 
g{riNs) = {rjNs + 1) log(77A^s + 1) — V^s ^og{riNs) where Ns is the mean input photon 
number. In the low-photon number regime, one can come very close to achieving the 
capacity by employing a binary phase-shift keying (BPSK) strategy (using coherent states 



M. M. Wilde, O. Landon-Cardinal, and P. Hayden 



\a) and \~a) as the signaling states) [21J. The BPSK strategy induces a cq channel of 
the following form: x —?' |(— 1)^q:)((— 1)^q;|. The symmetric Holevo rate for this channel is 
equal to x(^) = ^2([l + e~^^]/2), where h2 is the binary entropy and E = r/iVg. If the 
receiver performs a Helstrom measurement at every channel output, this induces a classical 
channel with symmetric mutual information equal to /hci(£') = 1 — /i2([l ~ \/l — e^'^^]/2). 
(See Ref. [9j, for example, for explicit calculations.) Our results in the previous section 
demonstrate that the fraction of information bits required to be decoded using a collective 
strategy is equal to 1 — Ib_ci{E) / x{E) . Figure [2] reveals that this fraction is rather small 
for mean photon number (MPN) larger than one, but then it rises sharply as we enter a 
quantum regime where the MPN is less than one. Even deep in the quantum regime at a 
MPN of 10"^, however, roughly 10% of the bits do not require collective decoding. 

1 6 Small Blocklength Polar Decoders 

This section briefly discusses how the Helstrom measurements [TTl [T^ in the quantum 
successive cancellation decoder from Ref. [23] decompose for very small size polar codes. 

6.1 Two-Bit Polar Decoder 

We begin by considering the simple two-bit polar code. The channel is of the form x — >■ Px, 
where x € {0, 1} and px is some conditional density operator. The two-bit polar code 
performs the simple transformation on the input bits ui and U2'- 

(ui,U2) -> (Ul -t-U2,U2), (63) 

where addition is modulo 2. 

The first step of the successive cancellation decoder is to recover ui , assuming that bit U2 
is chosen uniformly at random. The optimal measurement is a Helstrom measurement, and 
in this case, it amounts to distinguishing between the following two states 

^^PU2<» PU2, ^^PU2 + 1<^ PU2- (64) 

The Helstrom measurement is given by the projector onto the positive eigenspace of the 
difference of the two density operators above: 



\\^PU2® Pu2-\^ PU2 + 1 ® PU2>^ \ = \ 'Y^iPu 
I "2 "2 J I "2 

= \ E^^^)^'{PO-Pl)®Pu. 

K "2 



Pu2+i)®Pu2 > 0| (65) 
> \ (66) 



= |(P0-pi)®^(-l)"Vn2 >o| (67) 

= {(Po-pi)®(po-pi)>0}. (68) 

Thus, this test factorizes into the parity of the individual quantum hypothesis tests {{pq — Pi) > 
That is, supposing that n+ = {(po — Pi) > 0} and n_ = {(po — pi) < 0}, one can write the 
two-bit test as the product of two controlled gates 

Ui = Ib, ® (n+)B, (E}Ia + Ib, «) (H_)b, (g) {(Jx)a, (69) 
U2 = (H+)b^ (8) Ib2 <»Ia + {^-)b, ® Ib2 «) {crx)A, (70) 
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where Bi is the first channel output, B2 is the second channel output, and A is an ancillary 
system initialized to the state |0). The product of these two unitary gates is equal to 

U1U2 = ((n+)Bi ® (n+)B, + (n_)B, ® (n-)Bj ® Ia+ 

((n_)B, ® (n+)s, + (n+)B, ® (n_)Bj ® iax)A- (7i) 

The receiver would then measure the ancillary system A in order to make a decision about ui . 

Next, we determine the decoding of U2, given that ui has already been decoded. By the 
definition of the polar encoder transformation in (63 1, the goal is to distinguish between the 
following two states: 

Pui®Po, Pm+i®Pi- (72) 

The optimal quantum hypothesis test is given by the following projector: 

{Pui i^ipo -Pui+i fXipi > 0}. (73) 

This optimal quantum hypothesis test is not factorizable into smaller tests, and indeed, it 
is necessary to perform a collective measurement in order to implement it. Nonetheless, 
Lemma[5]provides a simple implementation of the Fuchs-Caves measurement for distinguishing 
these two states. 



6.2 Four-Bit Polar Decoder 

We now consider the form of Helstrom measurements for a four-bit polar code. Recall that 
the input transformation for the four-bit polar code is as follows: 

(m1,U2,W3,'"4) [ui +U2+UZ + Ui,Uz + Ui,U2 + U4,'U4)- (74) 

It is straightforward to find the form of the four different tests for decoding ui through U4. 
(See the appendix for derivations.) The test for decoding ui is again a parity test: 

{{po~Pir'>0}. (75) 

The test for decoding U2 given wi is 




(76) 

It remains unclear to us if there is a simple way to decompose the above test any further 
into non-collective actions (or even approximately using, e.g., the Fuchs-Caves measurement). 
The test for decoding U3 given U2 and wi is 

{(/3ni+«2 ® PO - P«i+«2 + l ® Pi) ® (P«2 ® PO - PU2 + 1 ® Pi) > 0}. (77) 

One could actually approximate this test "efficiently" by performing a product Fuchs-Caves 
measurement of the first two systems, a product Fuchs-Caves measurement of the last two, 
and then take the parity of the results of these two tests (of course implementing these tests 
coherently). The final Helstrom test for decoding M4 given U3, U2, and ui is 

{Pm+U2+Us ^ Pii3 '9 PU2 PO - P«l+«2+«3 + l P«3 + l P"2 + l Pi > 0}. (78) 

Clearly, it would be better to perform this last test by processing the likelihood ratios 
resulting from individual Fuchs-Caves measurements, rather than performing the optimal 
collective Helstrom measurement. 
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6.3 Polar Decoder for Larger Blocklengths 

One can continue in the above fashion to determine the form of a quantum successive 
cancellation decoder that recovers each bit of an eight-bit polar code. We again try to 
simplify each Helstrom measurement and provide an expression for each one in Appendix IS] 



A few tests simplify in particular those used to recover the first bit ui (Eq. (101 1), the fifth 



bit M5 (Eq. (|108l, the seventh bit M7 (Eq. (llll), and the last bit (Eq. (113)). However 



for the other tests, it is unclear if they can be approximated by some combination of Helstrom 
and Fuchs-Caves measurements, followed by coherent post-processing. 

From considering the eight-bit polar decoder, we can make several observations. For any 
blocklength, it is always possible to recover the first bit efficiently by calculating the parity of 
individual Helstrom measurements (though, this bit is always the "worst" bit, so the receiver 
would never actually be decoding it in practice). The receiver can always recover the last 
bit by performing a Fuchs-Caves measurement (this is always the "best" bit, so this should 
already be evident from the main observation in this paper). Furthermore, there are many 
bits that can be recovered by first performing Fuchs-Caves measurements, followed by the 
parity of these tests. Unfortunately, the fraction of these tests tends to zero in the limit of 
large blocklength. Thus, there still remains much to understand regarding the structure of a 
polar decoder. 



^ Conclusion 

The main result of this paper is an advance over previous schemes for decoding classical 
information transmitted over channels with classical inputs and quantum outputs. In 
particular, we have shown that N ■ /(Wacc) of the information bits can be decoded reliably 
and efficiently on a quantum computer by a "non-collective" coherent decoding strategy, 
while closing the gap to the Holevo information rate (decoding the other N{I{W) — /(Wacc)) 
bits) should require a collective strategy. For the pure-loss bosonic channel, this implies 
that the majority of the bits transmitted can be decoded by a product strategy whenever 
the mean photon number is larger than one, while the fraction of collective measurements 
required increases sharply as the mean photon number decreases below one, marking the 
beginning of the quantum regime. Remarkably, even at mean photon numbers as low as 10~*, 
roughly 10% of the bits do not require collective decoding, however. As another contribution, 
we have shown that a receiver can also employ collective Fuchs-Caves measurements when 
decoding a classical-quantum polar code. Finally, we gave the explicit form of the Helstrom 
measurements of a quantum successive cancellation decoder for two-, four-, and eight-bit 
polar codes. This should be helpful in determining the explicit form of tests for larger 
blocklength polar codes. 

The main open question is still to determine whether all of the information bits can be 
efficiently decoded on a quantum computer. To answer this question, one might consider 
employing the Schur transform [S] 1101 3] and exploiting the structure inherent in polar codes. 
Unfortunately, it is not clear to us that this approach will lead to a quantum successive 
cancellation decoder with time complexity O(A^logiV) because the complexity of the Schur 
transform is higher than this. 

We acknowledge helpful discussions with Frederic Dupuis, Saikat Guha, Hari Krovi, David 
Poulin, and Joseph Renes. MMW acknowledges support from Montreal's Centre de Recherches 
Mathcmatiques. OLC aknowledges support from NSERC through a Vanier scholarship. PH 
acknowledges support from the Canada Research Chairs program, the Perimeter Institute, 
CIFAR, FQRNT's INTRIQ, NSERC, and ONR through grant N000140811249. 
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I A Derivations for the Four-Bit Polar Decoder Measurements 

The four-bit polar encoder amounts to the following transformation: 

(Ul, 1*2, M3, M4) {Ui + 1*2 + U3 + U4, U3 + U4, U2 + Ui, M4). (79) 



A.l Recovering ui 

Let us first determine how the quantum successive cancellation decoder (QSCD) recovers 
the bit ui, assuming that 1*2, U3, and M4 are chosen uniformly at random. The test aims to 
distinguish between the following two states: 



^ E P"2+«3-(-«4 ® P«3+«4 ® PU2+U4 » PU4, (80) 

23 E PU2+U3+U4 + 1'^ Pu3+Ui'^ Pu2+Ui<^ PU4, (81) 
U2,U3,U4 

and it performs the following projection: 

{PU2+U3+U4 — Pu2+U3+U4 + l) ^ PU3+U4 'S) Ptt2-|-"4 P"4 — ^ f 
U2,U3,U4 ) 

= 1 E (-ir+"^+"Hpo-Pi)®Pn3+ 

l,lt2,M3,"4 

= |(P0-Pl)® (-l)"'''"'^"Vn3+«4®P«2+«4®P^.4 

I "2, 1*3, "4 J 

= I (PO - Pi) ® J2 (-l)"^+"V.3+«4 ® (-l)"^+"V«2+n4 ® (-1)"V«4 > } (84) 
I. U2,«3,«4 J 



U4 yy Pu2+U4 Pu4>0} (82) 

(83) 
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= j(po-pi)0 E ® ® (-ir^P«i > [ (85) 

I, "2 "3 "1 J 

= {(Po - Pi) O (po - Pi) O (po - Pi) (po - Pi) > 0}. (87) 
Thus, this first test nicely factors as the parity of the four individual tests {{po — pi) > 0}. 

A. 2 Recovering U2 given ui 

We now determine how the quantum successive cancellation decoder recovers U2 given ui, 
while randomizing over us and U4. The aim is to distinguish between the following two 
states: 

^ P«l+U3+«4 ® Pu3+«4 ® P«4 P«4, (88) 

«3,«4 



^ E P«l+l+"3+«4 <2) Pu3+«4 ® Pl+„4 O P„4, (89) 

M3,«4 

which translates to a projection of the following form: 

S E ^«l+"3+«4 ® Pu3+«4 ® P«4 ® P«4 - Pui + l+U3+«4 ® P«3+«4 ® PI+U4 ® P«4 > (90) 
l.«3,«4 J 

Define ^3 = ^3 + U4 and the above becomes 

E P«l+«3 ® P«3 ® P«4 ® P«4 - Pui+l+U^ O P^ O Pl+«4 O P«4 > 

(91) 



A. 3 Recovering 1*3 given U2 and 

Let us determine how the QSCD recovers u-^ given 11,2 and Ui, while randomizing over U4. 
The test distinguishes between the following two states: 



\ E P«l+«2+«4 ^ P«4 <8) P«2+«4 <8) Pm4) (92) 
U4 

^ E P«l+«2 + l+U4 ® PI+U4 ® Pu2+«4 ® P«4> (93) 
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and amounts to a projector of the following form: 

K U4 U4 ) 

= \ ^(p„i+„2+„4 <E) Pu4 - Pui+U2+1 + U4 ® Pl+«4) ^ Pu2+«4 Pu4 > ^ \ (94) 
I J14 ) 

= ^^^i-^)"" {Pui+U2 ® PO- Pui+U2 + 1 ® Pi) «> Pu2+«4 ® PU4 > (95) 

= < (p„i+„2 ® P0~ Pm+U2 + 1 ® Pi) ^ ^(-l)"V„2+„4 «) /0„4 > l (96) 
I U4 ) 

= {(P«l+U2 «> PO - Pui+U2 + 1 Pi) «) (p„2 PO - P«2 + l Pi) > 0}- (97) 

Thus, this test nicely factorizes as the parity of two tests {(pui+M2 Po ~ Ptti+«2+i ® Pi) ^0} 
and {{pu2 <»Po- P«2+i ® Pi) > 0}- 

A. 4 Recovering U4 given M3, U2, and mi 

Finally, we determine how the QSCD recovers M4 given all of the previous bits. The test in 
this case just aims to distinguish the following states: 

Pu^+U2+U3 ^ Pus P«2 PO, (98) 
Pni+«2+M3 + l Pu3 + 1 P«2 + l Plj (99) 

and amounts to the following projection: 

{Pui+U2+U3 fX" P„3 (Xl Pu2 Pa- Pui+U2+"3 + l ^ P"3 + l ^ P"2 + l Pi > 0}- (100) 

Measurements for the Eight-Bit Polar Decoder 

Here, we provide the form of a quantum successive cancellation decoder that recovers each 
bit of an eight-bit polar code. Full derivations of the results in this section are available from 
the authors upon request. 

B. l Recovering ui 

The test to recover the first bit ui is simply the parity of eight individual Helstrom measure- 
ments: 

{(Po-Pi)^'>0}. (101) 
B.2 Recovering U2 given ui 

The test to recover bit U2 given ui projects onto the positive eigenspace of the difference of 



Pu,+u'^+u'^+u', «> Pu'^ 'S) Pu'^ 'S) Pu'A 'S) \ Pu'^+K+u'^ Pu'g Pui, «> Pu'^ 



(102) 
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J2u' p«i+«2+i+< Pu') ® (E«'. Pi+u; ^' Pu' ) > 



and 

p„j+„^+„^+„^ (g) Pu;^ P„U ^ p„^+„^+„^ (g) (g) (g . 

(103) 

As such, it is not clear to us how one could approximate this test as some combination of 
Helstrom and Puchs-Caves tests. 

B.3 Recovering ^3 given U2, and ui 

The test to recover bit U3 given m and U2 is equal to the parity of the following two tests: 

(104) 

"rj - " J 

(E< Pu,+u', ®Pu'^)^ {Eu'^ Pu'i ® P<) 1 ^^^^^ 

- (E„^ Pu^+u', + 1 ® Pu'^ ® (E„^' Pl+u- ® Pu-) > J 

It is again unclear to us how to decompose this measurement further. 
B.4 Recovering U4 given U3, U2, and ui 

The test to recover bit U4 given ui, U2, and U3 projects onto the positive eigenspace of the 
difference of 

J2 Pui+U2+U3+U'^ Pu'}\ <^ P^3+K '^Pu'e]'^ P^2+U'^ ^ Pu'^] «) P< P< 

(106) 

and 



(107) 



Again, this one remains unclear how to decompose further. 

B.5 Recovering given U4 ui 

The test to recover bit U5 given m through U4 is equal to 



(P«l+«2+«3+«4 PO — P«l+«2+M3+«4 + l "^l Pi) (g (p„3+„4 (g PO " P«3+M4 + l Pi) 
2+M4 O PO - P«2+«4 + l <2) Pi) (g (P„4 (g PO - P«4 + l O Pi) > 



. (108) 

It is easy to see that one could approximate this test by first performing four Fuchs-Caves 
measurements on adjacent pairs of channel outputs and taking the parity of these tests. 
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B.6 Recovering uq given ui 

The test to recover bit uq given ui through is a projection onto the positive eigenspace of 
the difference of 



(109) 



and 



(110) 

A simple decomposition of this test remains unclear. 

B.7 Recovering uj given uq ui 

The test for recovering bit uy given the previous ones is 

(Pu2+U4+«6 ^ P"6 Pu4 (8) PO - P«2+U4 + «8 + l ^ ^"6 + 1 P«4 + l ^ Pi) > 

(111) 

which is clearly implementable by performing a Fuchs-Caves measurement on the first four 
qubits and the last four, and than taking the parity of these two tests. 

B.8 Recovering us given uj ui 

The final test for recovering the last bit wg given all others is a projection onto the positive 
eigenspace of the difference of 

Pui+---+m ® PU5+U6+M7 ® P«3+U4 + «7 ® P«7 ® P«2+U4+«6 ® Pv-e ® P«4 ® PO , (112) 

and 

PmiH htl7 + l ^ Pu.^+ue+U7 + l® Pu3+Ui+U7 + l<i^ Pm + l^ Pu2+Ui+ue + l^ Pue + 1^ Pu4 + 1^ Pi- (113) 

It is clear that we can approximate this test with a Fuchs-Caves measurement. 



